Monte Carlo Evaluation of Ordinal d with Improved Confidence Interval
نویسندگان
چکیده
منابع مشابه
Adapting Improved Upper Confidence Bounds for Monte-Carlo Tree Search
The UCT algorithm, which combines the UCB algorithm and Monte-Carlo Tree Search (MCTS), is currently the most widely used variant of MCTS. Recently, a number of investigations into applying other bandit algorithms to MCTS have produced interesting results. In this research, we will investigate the possibility of combining the improved UCB algorithm, proposed by Auer et al. [2], with MCTS. Howev...
متن کاملImproved Monte-Carlo Search
Monte-Carlo search has been successful in many non-deterministic games, and recently in deterministic games with high branching factor. One of the drawbacks of the current approaches is that even if the iterative process would last for a very long time, the selected move does not necessarily converge to a game-theoretic optimal one. In this paper we introduce a new algorithm, UCT, which extends...
متن کاملBrownian confidence bands on Monte Carlo output
When considering a Monte Carlo estimation procedure, the path produced by successive partial estimates is often used as a guide for informal convergence diagnostics. However the confidence region associated with that path cannot be derived simplistically from the confidence interval for the estimate itself. An asymptotically correct approach can be based on the Brownian motion approximation of ...
متن کاملAdaptive Markov Chain Monte Carlo Confidence Intervals
In Adaptive Markov Chain Monte Carlo (AMCMC) simulation, classical estimators of asymptotic variances are inconsistent in general. In this work we establish that despite this inconsistency, confidence interval procedures based on these estimators remain consistent. We study two classes of confidence intervals, one based on the standard Gaussian limit theory, and the class of so-called fixed-b c...
متن کاملMarkov Chain Monte Carlo Confidence Intervals
For a reversible and ergodic Markov chain {Xn, n ≥ 0} with invariant distribution π, we show that a valid confidence interval for π(h) can be constructed whenever the asymptotic variance σ P (h) is finite and positive. We do not impose any additional condition on the convergence rate of the Markov chain. The confidence interval is derived using the so-called fixed-b lag-window estimator of σ P ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Modern Applied Statistical Methods
سال: 2004
ISSN: 1538-9472
DOI: 10.22237/jmasm/1099267560